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SELF-DESCRIPTIVE MICROPHONE ARRAY 



BACKGROUND 

10 

Technical Field: 



The invention is related to a microphone array having one or more 
microphones in a predetermined arrangement, and in particular, to a self- 
15 descriptive microphone array that automatically self-calibrates and provides a 
current configuration and state to a computer to which it is connected, so that the 
computer can automatically configure audio processing software to be used for 
processing audio signals captured via the microphone array. 

20 Related Art: 

Conventional microphone array type devices are well known to those 
skilled in the art. In general, microphone arrays typically include an arrangement 
of microphones in some predetermined layout. These microphones are generally 
25 used to capture sounds from various directions and originating from different 
points in space. Once captured, onboard sound processing software and 
hardware then provides sound processing capabilities, such as, for example, 
sound source localization, beam forming, acoustic echo cancellation, noise 
suppression, etc. 

30 

For example, one common use for such arrays involving audio 
conferencing systems is to determine the direction of a dominant speaker in a 
room having both active speech and other noise, and then to process the input 
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from the various microphones in the array accordingly. In particular, given the 
input from each of the microphones in the array, conventional beam forming and 
sound source localization computations are used to localize the position and 
direction of the person currently speaking. With this information, it is then 
5 possible to filter out all sounds not coming from the direction of the speaker, 
thereby improving the overall quality of the captured sound with respect to the 
person speaking. 



Further, many microphone arrays do adaptive beamforming processing 
10 within the array itself. However, acoustic echo cancellation (AEC) processing 
needs to be tightly coupled to any associated adaptive beamforming processing 
in order to work properly. Consequently, when external software applications 
attempt to provide AEC processing in a remote computing device, such as a PC- 
type computer, while performing beamforming computations within the 
15 microphone array itself, the AEC typically fails, or provides sub-optimal results. 
Therefore, such arrays must typically include additional onboard processing 
capabilities, thereby increasing array expense, in order to perform adaptive 
beamforming processing in combination with AEC processing. 



20 As noted above, such microphone arrays typically include onboard 

processing hardware and software within the microphone array itself for 
performing analog and/or digital sound processing. Unfortunately, such onboard 
hardware tends to be significantly more expensive than the actual microphones 
in the array. Further, because the hardware and software is typically included 

25 within the array, updating the hardware and software is often difficult or 
impossible for an end-user of the equipment. 



In addition, every microphone, and thus every microphone within a 
microphone array, tends to have slightly unique properties with respect to 
30 parameters, such as, for example, sensitivity, frequency response, transient 
response, and directivity vs. frequency. This is typically true even of 
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microphones of the same model or type. Therefore, software for interfacing with 
microphone arrays is typically specially designed to operate with particular 
microphone arrays, or includes DLL's or drivers specifically tailored to particular 
microphone arrays that rely on external computers for audio processing. 

5 

For example, with microphone arrays that include onboard processing 
capabilities, the manufacturer typically knows the exact parameters, i.e., 
frequency response curves, etc. of the microphones in the array, and simply 
designs or modifies the software to suit the particular configuration of each 
10 specific array. Similarly, with passive microphone arrays that rely on an external 
computer for sound processing capabilities, the manufacturer of the microphone 
array typically provides software DLL's or drivers which reside on the computer to 
which the microphone array is coupled, and which are designed or modified to 
suit the particular known configuration of the array. 

15 

Unfortunately, where the parameters of the array are either not known by 
the user, or where a user desires to use a particular microphone array with 
software that was not specifically designed to operate with the particular 
microphone array, generic software drivers or DLL's operating on an external 
20 computer for processing audio inputs from the microphone array tend to produce 
sub-optimal audio processing results. 

Further, the operational parameters of individual microphones in a 
microphone array tend to change, if even only slightly, over time. Therefore, 
25 software tailored to a particular microphone array configuration can produce sub- 
optimal audio processing results as the parameters of the microphone array 
change over time. 

Therefore, what is needed is a microphone array that avoids the expense 
30 of onboard audio processing by acting as an inexpensive peripheral device and 
using the computational power of an external computer to which it is connected 
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for processing audio signals. Further, rather than requiring software to be 
specifically pre-tailored to the particular operational parameters of the 
microphone array, the microphone array should instead be operable with 
software that automatically configures itself to the operational parameters of the 
5 microphone array. Consequently, the microphone array should include the 
capability to automatically report those operational parameters to the external 
computer to allow for automatic configuration and optimization of audio 
processing software residing on that computer. 

10 

SUMMARY 

A self-descriptive microphone array, as described herein, includes an 
array of one or more microphones arranged in a predetermined pattern for 

15 capturing sounds. This self-descriptive microphone array operates to solve the 
problems identified above by providing a microphone array memory that is 
integral to the microphone array. The microphone array memory includes any 
type of conventional non-volatile memory, such as, for example a ROM, PROM, 
EPROM, EEPROM, or other conventional memory type or device, which contains 

20 a microphone array device description. This device description includes 
parametric information which defines operational characteristics and 
configuration of the self-descriptive microphone array. In operation, the device 
description of the self-descriptive microphone array is automatically reported to 
an external computing device via a microphone array interface to allow for 

25 automatic configuration of audio processing software residing within the external 
computing device. 

As noted above, the microphone array device description is automatically 
provided to an external computing device, such as a PC-type computer, or other 
30 computing device to which the microphone array is connected. The external 
computing device then uses the device description to automatically configure 
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audio processing software for processing one or more audio signals captured by 
the self-descriptive microphone array. Specifically, software drivers on the 
external computing device, such as DLL's or other software drivers, interpret 
physical parameters of the self-descriptive microphone array that are provided by 

5 the microphone array device description. These software drivers then 

communicate the physical parameter data of the self-descriptive microphone 
array to signal processing software residing within the external computing device. 
This allows the signal processing software to automatically adjust its parameters 
to the characteristics of the attached microphone array to perform automatically 

10 optimized audio processing computations. 

Consequently, because the self-descriptive microphone array makes use 
of external computing power, rather than including onboard audio processing 
hardware and software, the self-descriptive microphone array is relatively 

15 inexpensive to manufacture in comparison to conventional microphone array 
devices that include onboard audio processing capabilities. Further, because 
external processing power is used for audio processing, combined applications 
such as, for example, adaptive beamforming combined with acoustic echo 
cancellation (AEC) can be easily performed without including expensive audio 

20 processing softeare and/or hardware within the array itself. Consequently, one 
major advantage of moving microphone array audio processing to an external 
computing device is that it enables conventional conferencing applications, 
applications, such as, for example Microsoft® Windows® Messenger, or other 
real-time messaging applications, to use microphone arrays such as the self- 

25 descriptive microphone array described herein while significantly reducing 
microphone array costs. 

The connection between the self-descriptive microphone array and the 
external computing device is accomplished using any of a variety of conventional 
30 wired or wireless computer interfaces, including, for example, serial, IEEE 1394, 
USB, IEEE 802.1 1, Bluetooth™, etc., to connect to the external computing 
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device. As noted above, once connected, the self-descriptive microphone array 
provides its device description to the external computing device which then 
automatically configures audio processing software residing within the computing 
device for optimally processing one or more analog or digital audio signals 
5 provided by the self-descriptive microphone array. 

Further, as is well known to those skilled in the art, individual microphone 
operational characteristics, as well as the characteristics of most other electrical 
components, tend to change overtime, and as a function of local temperature. 

10 Consequently, in one embodiment, the self-descriptive microphone array 
includes an integral self-calibration system for automatically determining or 
evaluating at least some of the operational parameters of the microphones and 
associated preamplifiers comprising the self-descriptive microphone array. The 
microphone array device description within the microphone array memory is then 

15 automatically updated to reflect actual configuration of the self-descriptive 
microphone array. 

For example, in one embodiment, the integral self-calibration system is 
capable of automatically determining one or more of the sensitivity and gain (i.e., 

20 magnitude and phase gains) of the individual channels (microphone plus 

preamplifier) of the individual channels (microphone plus preamplifier) in the self- 
descriptive microphone array. In alternate embodiments, the integral self- 
calibration system of the self-descriptive microphone array operates 
automatically either upon connection to the computing device, upon regular or 

25 user-specified intervals, or upon command. 

As noted above, preamplifiers are associated with each microphone in the 
self-descriptive microphone array. Further, to allow for multiple simultaneous 
channels of audio to be captured by the self-descriptive microphone array, one or 
30 analog-to-digital (A/D) converters are also associated with each microphone. 

Audio signals captured by the microphone array are then pre-amplified (i.e., gain) 
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and converted to a digital signal via the A/D converters and provided, via the 
aforementioned wired or wireless computer interface, to the audio processing 
software residing within the external computing device for further processing, as 
desired. The maximum number of digital audio channels that can then be 
5 transmitted via the computer interface is then only limited by the maximum 

bandwidth of that computer interface in combination with the digital sampling rate 
of each channel of the microphone array. 

In another embodiment, in addition to including one or more microphones, 
10 the self-descriptive microphone array also includes one or more speakers for 
reproducing one or more audio signals. For example, many microphone arrays, 
such as those arrays used for audio conferencing, frequently include both 
microphones and speakers. The microphones capture sound, and the speakers 
play back sound. Generally, conventional audio conferencing-type microphone 
is arrays also include relatively expensive onboard acoustic echo cancellation 
capabilities so that local audio signals are not endlessly echoed during an audio 
conference. However, in the context of the self-descriptive microphone array, 
audio processing, such as acoustic echo cancellation, is performed via the audio 
processing software residing within the external computing device. Audio to be 
20 played back via the self-descriptive microphone array is then simply transmitted 
from the external computing device to the array via the aforementioned wired or 
wireless computer interface. 

As with the parametric information defining the microphones within the 
25 self-descriptive microphone array, parametric information defining the speakers 
within the self-descriptive microphone array is also stored within the microphone 
array memory. Configuration of the microphones and the speakers within the 
microphone array are then reported, as noted above, to allow for automatic 
configuration of the audio processing software residing within the external 
30 computing device to which the self-descriptive microphone array is connected. 
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In view of the above summary, it is clear that the self-descriptive 
microphone array provides a unique system and method for automatically 
reporting microphone array device configuration to an external computing device 
for automatic optimization of audio processing software. In addition to the just 
5 described benefits, other advantages of the self-descriptive microphone array will 
become apparent from the detailed description which follows hereinafter when 
taken in conjunction with the accompanying drawing figures. 

10 DESCRIPTION OF THE DRAWINGS 

The specific features, aspects, and advantages of the present invention 
will become better understood with regard to the following description, appended 
claims, and accompanying drawings where: 

15 

FIG. 1 is a general system diagram depicting a general-purpose 
computing device constituting an exemplary system for interfacing with a self- 
descriptive microphone array. 

20 FIG. 2 illustrates an exemplary system diagram showing exemplary 

hardware and software modules for implementing a self-descriptive microphone 
array. 

FIG. 3 illustrates an exemplary architectural layout of a hardware system 
25 embodying a self-descriptive microphone array. 

FIG. 4 provides an exemplary operational flow diagram for illustrating the 
operation of a self-descriptive microphone array. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



In the following description of the preferred embodiments of the present 
invention, reference is made to the accompanying drawings, which form a part 
5 hereof, and in which is shown by way of illustration specific embodiments in 
which the invention may be practiced. It is understood that other embodiments 
may be utilized and structural changes may be made without departing from the 
scope of the present invention. 

10 1-0 Exemplary Operating Environment: 

Figure 1 illustrates an example of a suitable computing system 
environment 100 on which the invention may be implemented. The computing 
system environment 100 is only one example of a suitable computing 
15 environment and is not intended to suggest any limitation as to the scope of use 
or functionality of the invention. Neither should the computing environment 1 00 
be interpreted as having any dependency or requirement relating to any one or 
combination of components illustrated in the exemplary operating environment 
100. 

20 

The invention is operational with numerous other general purpose or 
special purpose computing system environments or configurations. Examples of 
well known computing systems, environments, and/or configurations that may be 
suitable for use with the invention include, but are not limited to, personal 

25 computers, server computers, hand-held, laptop or mobile computer or 
communications devices such as cell phones and PDA's, multiprocessor 
systems, microprocessor-based systems, set top boxes, programmable 
consumer electronics, network PCs, minicomputers, mainframe computers, 
distributed computing environments that include any of the above systems or 

30 devices, and the like. 
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The invention may be described in the general context of computer- 
executable instructions, such as program modules, being executed by a 
computer in combination with hardware modules, including components of a 
microphone array 198. Generally, program modules include routines, programs, 

5 objects, components, data structures, etc., that perform particular tasks or 

implement particular abstract data types. The invention may also be practiced in 
distributed computing environments where tasks are performed by remote 
processing devices that are linked through a communications network. In a 
distributed computing environment, program modules may be located in both 

10 local and remote computer storage media including memory storage devices. 
With reference to Figure 1 , an exemplary system for implementing the invention 
includes a general-purpose computing device in the form of a computer 110. 

Components of computer 110 may include, but are not limited to, a 
15 processing unit 120, a system memory 130, and a system bus 121 that couples 
various system components including the system memory to the processing unit 
120. The system bus 121 may be any of several types of bus structures 
including a memory bus or memory controller, a peripheral bus, and a local bus 
using any of a variety of bus architectures. By way of example, and not 
20 limitation, such architectures include Industry Standard Architecture (ISA) bus, 
Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video 
Electronics Standards Association (VESA) local bus, and Peripheral Component 
Interconnect (PCI) bus also known as Mezzanine bus. 

25 Computer 1 1 0 typically includes a variety of computer readable media. 

Computer readable media can be any available media that can be accessed by 
computer 110 and includes both volatile and nonvolatile media, removable and 
non-removable media. By way of example, and not limitation, computer readable 
media may comprise computer storage media and communication media. 

30 Computer storage media includes volatile and nonvolatile removable and non- 
removable media implemented in any method or technology for storage of 



information such as computer readable instructions, data structures, program 
modules, or other data. 

Computer storage media includes, but is not limited to, RAM, ROM, 
5 PROM, EPROM, EEPROM, flash memory, or other memory technology; CD- 
ROM, digital versatile disks (DVD), or other optical disk storage; magnetic 
cassettes, magnetic tape, magnetic disk storage, or other magnetic storage 
devices; or any other medium which can be used to store the desired information 
and which can be accessed by computer 110. Communication media typically 
10 embodies computer readable instructions, data structures, program modules or 
other data in a modulated data signal such as a carrier wave or other transport 
mechanism and includes any information delivery media. The term "modulated 
data signal" means a signal that has one or more of its characteristics set or 
changed in such a manner as to encode information in the signal. By way of 
15 example, and not limitation, communication media includes wired media such as 
a wired network or direct-wired connection, and wireless media such as acoustic, 
RF, infrared, and other wireless media. Combinations of any of the above should 
also be included within the scope of computer readable media. 

20 The system memory 130 includes computer storage media in the form of 

volatile and/or nonvolatile memory such as read only memory (ROM) 131 and 
random access memory (RAM) 132. A basic input/output system 133 (BIOS), 
containing the basic routines that help to transfer information between elements 
within computer 110, such as during start-up, is typically stored in ROM 131 . 

25 RAM 132 typically contains data and/or program modules that are immediately 
accessible to and/or presently being operated on by processing unit 120. By way 
of example, and not limitation, Figure 1 illustrates operating system 134, 
application programs 135, other program modules 136, and program data 137. 

30 The computer 110 may also include other removable/non-removable, 

volatile/nonvolatile computer storage media. By way of example only, Figure 1 
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illustrates a hard disk drive 141 that reads from or writes to non-removable, 
nonvolatile magnetic media, a magnetic disk drive 151 that reads from or writes 
to a removable, nonvolatile magnetic disk 152, and an optical disk drive 155 that 
reads from or writes to a removable, nonvolatile optical disk 156 such as a CD 
5 ROM or other optical media. Other removable/non-removable, 

volatile/nonvolatile computer storage media that can be used in the exemplary 
operating environment include, but are not limited to, magnetic tape cassettes, 
flash memory cards, digital versatile disks, digital video tape, solid state RAM, 
solid state ROM, and the like. The hard disk drive 141 is typically connected to 
10 the system bus 121 through a non-removable memory interface such as interface 
140, and magnetic disk drive 151 and optical disk drive 155 are typically 
connected to the system bus 121 by a removable memory interface, such as 
interface 150. 

The drives and their associated computer storage media discussed above 
and illustrated in Figure 1, provide storage of computer readable instructions, 
data structures, program modules and other data for the computer 110. In Figure 
1 , for example, hard disk drive 141 is illustrated as storing operating system 144, 
application programs 145, other program modules 146, and program data 147. 
Note that these components can either be the same as or different from 
operating system 134, application programs 135, other program modules 136, 
and program data 137. Operating system 144, application programs 145, other 
program modules 146, and program data 147 are given different numbers here to 
illustrate that, at a minimum, they are different copies. A user may enter 
commands and information into the computer 1 10 through input devices such as 
a keyboard 162 and pointing device 161, commonly referred to as a mouse, 
trackball, or touch pad. 

Other input devices (not shown) may include a joystick, game pad, 
30 satellite dish, scanner, radio receiver, and a television or broadcast video 

receiver, or the like. These and other input devices are often connected to the 
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processing unit 120 through a wired or wireless user input interface 160 that is 
coupled to the system bus 121 , but may be connected by other conventional 
interface and bus structures, such as, for example, a parallel port, a game port, a 
universal serial bus (USB), an IEEE 1394 interface, a Bluetooth™ wireless 

5 interface, an IEEE 802.1 1 wireless interface, etc. Further, the computer 110 may 
also include a speech or audio input device, such as a microphone or a 
microphone array 198, as well as a loudspeaker 197 or other sound output 
device connected via an audio interface 199, again including conventional wired 
or wireless interfaces, such as, for example, parallel, serial, USB, IEEE 1394, 

10 Bluetooth™, etc. 



A monitor 191 or other type of display device is also connected to the 
system bus 121 via an interface, such as a video interface 190. In addition to the 
monitor, computers may also include other peripheral output devices such as a 
15 printer 196, which may be connected through an output peripheral interface 195. 

The computer 110 may operate in a networked environment using logical 
connections to one or more remote computers, such as a remote computer 180. 
The remote computer 180 may be a personal computer, a server, a router, a 

20 network PC, a peer device, or other common network node, and typically 

includes many or all of the elements described above relative to the computer 
110, although only a memory storage device 181 has been illustrated in Figure 1 . 
The logical connections depicted in Figure 1 include a local area network (LAN) 
171 and a wide area network (WAN) 173, but may also include other networks. 

25 Such networking environments are commonplace in offices, enterprise-wide 
computer networks, intranets, and the Internet. 



When used in a LAN networking environment, the computer 110 is 
connected to the LAN 171 through a network interface or adapter 170. When 
30 used in a WAN networking environment, the computer 110 typically includes a : 
modem 172 or other means for establishing communications over the WAN 173, 

i 
i 
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such as the Internet. The modem 172, which may be internal or external, may be 
connected to the system bus 121 via the user input interface 160, or other 
appropriate mechanism. In a networked environment, program modules 
depicted relative to the computer 1 10, or portions thereof, may be stored in the 
5 remote memory storage device. By way of example, and not limitation, Figure 1 
illustrates remote application programs 185 as residing on memory device 181 
It will be appreciated that the network connections shown are exemplary and 
other means of establishing a communications link between the computers may 
be used. 

10 

The exemplary operating environment having now been discussed, the 
remaining part of this description will be devoted to a discussion of the program 
modules and processes embodying a "self-descriptive microphone array" which 
automatically provides configuration information describing physical parameters 
15 of the microphone array to an external computing device for automatic 
configuration of audio processing software and/or hardware. 

2.0 Introduction: 

20 A self-descriptive microphone array, as described herein, includes an 

array of one or more microphones arranged in a predetermined pattern for 
capturing sounds. This self-descriptive microphone array operates to solve the 
problems identified above by providing a microphone array memory that is 
integral to the microphone array. The microphone array memory includes any 

25 type of conventional non-volatile memory, such as, for example a ROM, PROM, 
EPROM, EEPROM, or other conventional memory type or device, which contains 
a microphone array device description. This device description includes 
parametric information that defines operational characteristics and configuration 
of the self-descriptive microphone array. In operation, the device description of 

30 the self-descriptive microphone array is then automatically reported to an 

external computing device via a microphone array interface to allow for automatic 
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configuration of audio processing software residing within the external computing 
device for processing audio signals either captured by the self-descriptive 
microphone array, or in one embodiment, audio signals that are to be played 
back by one or more speakers residing within the self-descriptive microphone 
5 array. 

2.1 System Overview: 

As noted above, the microphone array device description is automatically 
10 provided to an external computing device, such as a PC-type computer, or other 
computing device to which the microphone array is connected. The external 
computing device then uses the device description to automatically configure 
audio processing software for processing one or more audio signals captured by 
the self-descriptive microphone array. Specifically, software drivers on the 
15 external computing device, such as DLL's or other software drivers, interpret 
physical parameters of the self-descriptive microphone array that are provided by 
the microphone array device description. These software drivers then 
communicate the physical parameter data of the self-descriptive microphone 
array to signal processing software residing within the external computing device. 
20 This allows the signal processing software to automatically adjust its parameters 
to the characteristics of the attached microphone array to perform automatically 
optimized audio processing computations. 

Consequently, because the self-descriptive microphone array makes use 
25 of external computing power, rather than including onboard audio processing 
hardware and software, the self-descriptive microphone array is relatively 
inexpensive to manufacture in comparison to conventional microphone array 
devices that include onboard audio processing capabilities. Further, because 
external processing power is used for audio processing, combined applications 
30 such as, for example, adaptive beamforming combined with acoustic echo 
cancellation (AEC) can be easily performed without including expensive audio 
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processing softeare and/or hardware within the array itself. Consequently, one 
major advantage of moving microphone array audio processing to an external 
computing device is that it enables conventional conferencing applications, 
applications, such as, for example Microsoft® Windows® Messenger, or other 
5 real-time messaging application, to use microphone arrays such as the self- 
descriptive microphone array described herein while significantly reducing 
microphone array costs. 

The connection between the self-descriptive microphone array and the 
10 external computing device is accomplished using any of a variety of conventional 
wired or wireless computer interfaces, including, for example, serial, IEEE 1394, 
USB, IEEE 802.1 1, Bluetooth™, etc., to connect to the external computing 
device. As noted above, once connected, the self-descriptive microphone array 
provides its device description to the computing device which then automatically 
is configures audio processing software residing within the computing device for 
optimally processing one or more analog or digital audio signals provided by the 
self-descriptive microphone array. 

Further, as is well known to those skilled in the art, individual microphone 
20 operational characteristics, as well as the characteristics of most other electrical 
components, tend to change over time, and as a function of local temperature. 
Consequently, in one embodiment, the self-descriptive microphone array 
includes an integral self-calibration system for automatically determining or 
evaluating at least some of the operational parameters of the microphones 
25 comprising the self-descriptive microphone array. The microphone array device 
description within the microphone array memory is then automatically updated to 
reflect actual configuration of particular elements of the self-descriptive 
microphone array. 

30 As noted above, preamplifiers are associated with each microphone in the 

self-descriptive microphone array. Further, to allow for multiple simultaneous 
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channels of audio to be captured by the self-descriptive microphone array, 
analog-to-digital (A/D) converters are also associated with each microphone. 
Audio signals captured by the microphone array are then pre-amplified (i.e., gain) 
and converted to a digital signal via the A/D converters and provided, via the 

5 aforementioned wired or wireless computer interface, to the audio processing 
software residing within the external computing device for further processing, as 
desired. The maximum number of digital audio channels that can then be 
transmitted via the computer interface is then only limited by the maximum 
bandwidth of that computer interface in combination with the digital sampling rate 

10 of each channel of the microphone array. 

In this embodiment, the integral self-calibration system is capable of 
automatically determining one or more of the sensitivity and gain (i.e., magnitude 
and phase gains) of the individual channels (microphone plus preamplifier) in the 

15 self-descriptive microphone array. In alternate embodiments, the integral self- 
calibration system of the self-descriptive microphone array operates 
automatically either upon connection to the computing device, upon regular or 
user-specified intervals, or upon command. One advantage of this embodiment 
is that because exact parameters (i.e., impulse response) of components such as 

20 microphones and preamplifiers are determined automatically via self-calibration, 
there is no need for using closely matched, and thus relatively expensive, sets of 
microphones and preamplifiers. Any differences in impulse response between 
individual microphone/preamplifier combinations are simply reported to the 
external computing device for automatically configuring the audio processing 

25 software to optimally respond to the various properties of individual 

microphone/preamplifier combinations within the self-descriptive microphone 
array. 

In another embodiment, in addition to including one or more microphones, 
30 the self-descriptive microphone array also includes one or more speakers for 
reproducing one or more audio signals. For example, many microphone arrays, 
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such as those arrays used for audio conferencing, frequently include both 
microphones and speakers. The microphones capture sound, and the speakers 
play back sound. Generally, conventional audio conferencing-type microphone 
arrays also include relatively expensive onboard acoustic echo cancellation 
5 capabilities so that local audio signals are not endlessly echoed during an audio 
conference. However, in the context of the self-descriptive microphone array, 
audio processing, such as acoustic echo cancellation, is performed via the audio 
processing software residing within the external computing device. Audio to be 
played back via the self-descriptive microphone array is then simply transmitted 
10 from the external computing device to the array via the aforementioned wired or 
wireless computer interface. 

As with the parametric information defining the microphones within the 
self-descriptive microphone array, parametric information defining the speakers 
within the self-descriptive microphone array is also stored within the microphone 
array memory. Configuration of the microphones and the speakers within the 
microphone array are then reported, as noted above, to allow for automatic 
configuration of the audio processing software residing within the external 
computing device to which the self-descriptive microphone array is connected. 

2.2 System Architecture: 

The processes summarized above are illustrated by the general system 
diagram of FIG. 2. In particular, the system diagram of FIG. 2 illustrates the 
25 interrelationships between hardware and software modules for implementing a 
self-descriptive microphone array. It should be noted that any boxes and 
interconnections between boxes that are represented by broken or dashed lines 
in FIG. 2 represent alternate embodiments of the self-descriptive microphone 
array described herein, and that any or all of these alternate embodiments, as 
30 described below, may be used in combination with other alternate embodiments 
that are described throughout this document. 
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In general, the self-descriptive microphone array includes a microphone 
module 200 comprising one or more microphones, such as, for example, 
conventional electret microphones, along with circuitry for amplifying analog 
audio signals captured by the microphone module 200, and for converting the 

5 analog signals to a digital format. In particular, amplification of captured signals 
is provided by a preamp module 210 comprising one or more preamplifier circuits 
which provide gain for amplifying the captured audio signals. An A/D conversion 
module 220 then provides one or more A/D converters for converting analog 
signals captured by the microphones into digital signals for transmission to an 

10 external computing device 290 via a microphone array input/output module 250, 
which provides for conventional data transmission via one of the aforementioned 
wired or wireless computer interfaces. As described below, parametric 
information relating to the gain provided by the preamp module 210 is included 
along with the information stored in a microphone array memory module 230. 

15 

Further, in one embodiment, the microphone module 200, preamp module 
210, and A/D conversion module 220, are combined into one module (not shown) 
in the case of microphones such as a MEMS microphone. For example, as is 
well known to those skilled in the art, a Micro-Electro-Mechanical-Structure 

20 (MEMS) type microphone is basically an integrated circuit, typically very small in 
size, which includes a microphone and preamplifier, and in some cases, A/D 
conversion within a single circuit or microchip. The use of MEMS-type 
microphones in the self-descriptive microphone array described herein allows for 
a further reduction in components by using an integrated circuit which combines 

25 each of the preamp module 210, A/D conversion module 220, and microphone 
module 200 into one module which then provides the operational capability of the 
three separate modules. Consequently, because the same functionality is 
provided by a MEMS-type microphone as is provided by use of the separate 
modules, i.e., the microphone module 200, preamp module 210, and A/D 

30 conversion module 220, the self descriptive microphone array will be described in 
the context of these three modules. However, it should be understood that the 
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use of MEMS-type microphone technology is inherent in the description of these 
three modules. 

As noted above, the self-descriptive microphone array includes the 
5 microphone array memory module 230 for storing and reporting parametric 
information which defines operational characteristics and configuration of the 
self-descriptive microphone array. In general, the memory module 230 uses any 
type of conventional non-volatile memory or storage, such as, for example, ROM, 
PROM, EPROM, EEPROM, etc. The parametric information stored within the 
10 memory module 230 is reported to an external computing device 290, either 
upon connection of the self-descriptive microphone array to the external 
computing device, or upon a manual or automatic request for the information 
originating with the external computing device. As described herein, reporting of 
this parametric information allows for automatic configuration of audio processing 
is software residing within the external computing device 290 for processing audio 
signals either captured by the self-descriptive microphone array, or in one 
embodiment, audio signals that are to be played back by one or more speakers 
residing within the self-descriptive microphone array. 

20 In one embodiment, the parametric information stored in the microphone 

array memory module 230 is maintained in a lookup table which includes 
parametric information describing the configuration of the self-descriptive 
microphone array. In general, this lookup table, or other means of storage, 
includes one or more of the following elements of parametric information: 1) 

25 microphone array manufacturer, model, and version; 2) microphone types and 
position; 3) microphone array working volume (i.e., where the sound source is 
expected to be); 4) microphone gain calibration (inexpensive microphones and 
preamplifier combinations can have a +/- 4dB gain difference due to 
manufacturing variance); and 5) speaker configuration for any speakers included 

30 in microphone array. 
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As noted above, one embodiment of the self-descriptive microphone array 
includes self-calibration capabilities. These self-calibration capabilities are 
provided via a microphone array self-calibration module 240. This microphone 
array self-calibration module 240 automatically determines a current state of one 
5 or more of the components of the microphone array. This current state is then 
used to automatically update the parametric information stored in the microphone 
array memory module 230. Note that the microphone array self-calibration 
module 240 is discussed in further detail below in Section 3. 

10 Further, also as noted above, one embodiment of the self-descriptive 

microphone array includes a set of one or more speakers. This embodiment also 
includes one or more digital-to-analog (D/A) converters and one or more 
amplifiers. In particular, in this embodiment, a D/A conversion module 260 
provides one or more D/A converters for performing digital-to-analog conversion 

15 of one or more digital signals provided by the external computing device 290 via 
the microphone array input/output module 250. An amplifier module 270 then 
provides amplification of the converted analog signals. These analog signals are 
then provided to a speaker module 280 for playback. In particular, the speaker 
module 280 includes one or more speakers for reproducing the amplified analog 

20 audio signals. Again, in this embodiment, the microphone array memory module 
230 further includes parametric information defining physical characteristics of 
the speakers within the self-descriptive microphone array. 

3.0 Operation Overview: 

25 

The above-described hardware and software modules are employed for 
implementing the self descriptive microphone array. As summarized above, this 
self-descriptive microphone array provides automatic reporting of physical 
parameters defining components of the microphone array to an external 
30 computing device. Automatic reporting of these physical parameters then allows 
automatic configuration and optimization of audio processing software residing 
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within the external computing device. The following sections provide a detailed 
discussion of the architecture (FIG. 3) and operation (FIG. 4) of the self- 
descriptive microphone array, and of exemplary methods for implementing the 
hardware and software modules described in Section 2. 

5 

It should be noted that any boxes and interconnections between boxes 
that are represented by broken or dashed lines in either FIG. 3 or FIG. 4 
represent alternate embodiments of the self-descriptive microphone array 
described herein, and that any or all of these alternate embodiments, as 
10 described below, may be used in combination with other alternate embodiments 
that are described throughout this document. 

3.1 Microphone Array Architecture: 

15 The processes described above with respect to FIG. 2 are illustrated by 

the general architectural diagram of FIG. 3. In particular, FIG. 3 illustrates an 
exemplary architectural layout of hardware embodying the microphone array. 
For example, as illustrated by FIG. 3, a self-descriptive microphone array 300 
comprises an array 305 of one or more microphones (310 through 325), a 

20 microphone array memory 340 which contains parametric information that 
defines operational characteristics and configuration of the self-descriptive 
microphone array, and at least one external interface 350, including, for example, 
serial, IEEE 1394, USB, IEEE 802.1 1, Bluetooth™, etc., for connecting the self- 
descriptive microphone array to an external computing device 290. 

25 

Further, the array 305 of microphones included in the self-descriptive 
microphone array 300 includes one or more preamplifiers 330 for providing gain 
or preamplification of each microphone (310 through 325). In a related 
embodiment, the array 305 further includes one or more Analog-to-Digital (A/D) 
30 converters 335 for digitizing an analog audio input from each microphone (310 
through 325). Note that both preamplifiers and A/D converters are well known 
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and understood by those skilled in the art, and will not be described in detail 
herein. 

In another embodiment, the self-descriptive microphone array 300 

5 includes a self calibration system 345 which automatically determines a current 
state of one or more of the components of the microphone array. This current 
state is then used to automatically update one or more of the operational 
characteristics stored in the microphone array memory 340. For example, in one 
embodiment, the self calibration system 345 automatically determines 

10 preamplifier 330 impulse responses. In general, this determination is made by 
providing a "pulse injection circuit" for injecting a precise low-amplitude analog 
pulse at the input of the preamplifier 330. The precise impulse response of the 
preamplifier 330 is then measured for computing frequency-domain 
compensation gains for each preamplifier which serve to provide a consistent 

is output from each amplifier regardless of the operational characteristics of each 
microphone/preamplifier combination. Repeating this process for each 
preamplifier and storing the resulting preamplifier 330 frequency-domain 
compensation gains in the microphone array memory 340 allows for precise 
configuration of audio processing software residing on the external computing 

20 device 290 using the frequency-domain compensation gains for each 
preamplifier. 

One clear advantage of this embodiment is that by knowing a precise 
frequency-domain compensation gain for each preamplifier 330, software drivers 

25 associated with audio processing software residing on the external computing 
device 290 can then easily compensate for phase response mismatches across 
all preamplifiers. Without compensation, such mismatches would reduce the 
performance of certain audio processing applications. For example, the 
performance of conventional beamforming or sound source localization (SSL) 

30 digital signal processing software, which combines all microphone signals to 
provide a relatively narrow capture direction selectivity, will be significantly 
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improved by compensating for the precise phase response of each preamplifier 
330. Note that the self calibration system 345 for the self-descriptive microphone 
array 300 is described in further detail in a copending patent application entitled 
"ANALOG PREAMPLIFIER MEASUREMENT FOR A MICROPHONE ARRAY," 
5 having a filing date of February 4, 2004, and assigned Serial Number 7BD, the 
subject matter of which is incorporated herein by this reference. 

Finally, in yet another embodiment, the self-descriptive microphone array 
300 includes a speaker system 355. In general, this speaker system 355 

10 includes one or more speakers, one or more D/A converters, and one or more 
amplifiers for amplifying analog audio signals prior to playback by the speakers 
included in the speaker system. In this embodiment, audio signals provided by 
the external computing device 290 via the microphone array interface 350 are 
first converted to analog signals, amplified, and then reproduced by providing the 

15 amplified analog audio signals to the speakers of the speaker system 355. 

3.2 Microphone Array Operation: 

In general, as illustrated by FIG. 4, the self-descriptive microphone array 
20 described above operates by first connecting the self-descriptive microphone 
array to the external computing device (Box 400). As noted above, this 
connection is accomplished using a conventional wired or wireless computer 
interface, such as, for example, serial, parallel, IEEE 1394, USB, IEEE 802.11, 
Bluetooth™, etc., for connecting the self-descriptive microphone array to the 
25 external computing device. 

In one embodiment, once connected, self-calibration of the self-descriptive 
microphone array is initiated. In a tested embodiment, this self-calibration is 
performed automatically (Box 410) as soon as the self-descriptive microphone 
30 array is connected to the external computing device (Box 400). In a related 
embodiment, the self-calibration is performed immediately upon manual user 
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request (Box 415), said request being provided from the external computing 
device via the computer interface. In another related embodiment, the self- 
calibration is performed immediately upon an external request (Box 420), such 
as, for example, a request generated by one an audio processing software 

5 program operating on the external computing device. Again, as with the manual 
request (Box 415), the external request (Box 420) is provided from the external 
computing device via the computer interface. In each of these embodiments, i.e., 
automatic, manual, or externally requested self-calibration, the microphone array 
device description automatically updates (Box 425) the microphone array 

10 parametric information 340 to reflect the current state of the microphone array as 
determined via the above-described self-calibration procedure. 

At this point, the parametric information 340 defining the current state of 
the self-descriptive microphone array is reported (Box 435) to the external 

15 computing device. As noted above, given the known operational characteristics 
of the components of the self-descriptive microphone array (i.e., microphone, 
speakers, preamps, etc.), audio processing software operating within the external 
computing device is automatically optimized and/or configured (Box 440) to 
provide a computing environment that is specifically tailored to the known 

20 parameters of the self-descriptive microphone array connected to the external 
computing device. 

For example, assuming a microphone array with two microphones, where 
one of the microphone channels has a gain of +4 dB more than the other 

25 microphone channel, the input received by either of the microphones is then 
weighted by a factor designed to compensate for the difference in gain so that 
the input provided by either of the microphones will be nominally equivalent. As 
a result of such adjustments, conventional processing of audio captured by the 
self-descriptive microphone array is significantly improved relative to audio 

30 processing without weighting the audio inputs to reflect actual microphone array 
configurations. 
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Once the audio processing software has been optimally configured, one or 
more audio signals are captured by the self-descriptive microphone array and 
provided to the external computing device, via the aforementioned computer 
interface, for audio processing, as desired (Box 445). Such processing may 

5 include, for example, conventional sound source localization, beam forming, 
acoustic echo cancellation, noise suppression, etc. Note that as such audio 
processing techniques are well known to those skilled in the art, they will not be 
described in detail herein. Finally, in one embodiment, the device description 
including the microphone array parametric information 340 is updated at any time 

10 during the above-described processes by requesting a self-calibration, either 
manually, or via an external request, as described above. 

However, it should be noted that it is not necessary to update all of the 
parametric information 340 as a part of the self-calibration. For example, certain 

15 parameters may be set, with no need for further updates, during manufacturing of 
the self-descriptive microphone array. For example, information such as the 
manufacturer name, microphone positions, harmonic distortion of microphones or 
speakers, etc., may be included in the parametric information at the time of 
manufacture, without the need to subsequently update such values. Other 

20 parameters, such as, for example, microphone gain, may be updated during the 
self-calibration. 

3.2.1 Microphone Array Lookup Table: 

25 As noted above, a tested embodiment of the microphone array parametric 

information 340 is implemented as a lookup table using an EEPROM. An 
EEPROM or similar rewritable addressable memory is used in this embodiment 
to allow for updating of the lookup table, either in response to microphone array 
self-calibration, or in response to user adjustment of lookup table parameters 

30 from the external computing device via the microphone array interface. 
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As noted above, this lookup table generally includes one or more of 1 ) 
microphone array manufacturer, model, and version; 2) microphone types and 
position; 3) microphone array working volume (i.e., where the sound source is 
expected to be); 4) microphone gain calibration (note that nominally identical 

5 microphones can have on the order of a +/- 4dB gain difference due to 
manufacturing variances); and 5) speaker configuration for any speakers 
included in microphone array. Clearly, additional information may be included in 
the lookup table if it is available. For example, additional parametric information 
that may be useful for configuring particular audio processing software includes 

10 response functions for the microphones in the array; response functions for any 
speakers in the array; wave coefficient tables for each microphone or speaker, 
etc. When available, such information is included in the lookup table, and 
reported to the external computing device as described above. 

15 For example, in a tested embodiment, the parametric information stored in 

the microphone array parametric information 340 includes parametric information 
such as, for example, the type of microphone array, e.g., linear, planar, three- 
dimensional, etc.. Further, the parametric information stored in the microphone 
array parametric information 340 includes number and geometry information 

20 regarding microphones in the array, including the number of microphones in the 
array; a working volume of each microphone in the array, e.g., the working 
elevation and azimuth for audio reception; the type of each microphone, e.g., 
omnidirectional, subcardiod, cardiod, supercardiod, hypercardiod, etc., positional 
information for each microphone in the array, e.g., and the physical location and 

25 orientation of each microphone in a three dimensional space of the array. 

In addition, the parametric information stored in the microphone array 
parametric information 340 also includes any gain associated with each 
microphone in the array. As noted above, in this tested embodiment, the 
30 parametric information stored in the microphone array parametric information 340 
also includes information describing any speakers included in the self-descriptive 
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microphone array. Finally, in one embodiment, additional parametric information 
including a manufacturer name, a microphone array model number, and a 
microphone array version or revision number, were also included within the 
parametric information stored the lookup table representing the microphone array 
5 parametric information 340. 

In further embodiments, additional information is included in the table to 
address particular microphone or microphone array types, parameters, and 
capabilities. For example, additional information that may be incorporated into 

10 the table includes, but is not limited to: microphone and speaker latencies (time it 
takes the PC to send/receive sound), including sampling latencies between 
microphones; phase difference between A/D and D/A conversions for the 
microphone and speakers respectively; frequency response for microphones and 
speakers; harmonic distortion for microphones and speakers; directivity pattern 

15 of microphones in a particular microphone array; terminal coupling loss for 

defining how well the microphones pick up the speaker or speakers in the array; 
and maximum speaker level. 

The table described in the preceding paragraphs is represented by TABLE 
20 1 , which illustrates one example of the lookup table described above as used in a 
tested embodiment of the self-descriptive microphone array. It should be 
appreciated that any desired information may be included in such tables, and that 
TABLE 1 is provided only for purposes of illustration and that the content of this 
exemplary table is not intended to limit the scope of the information included in 
25 such tables. 
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Field 


Description 


Note that the following table entries are used to define manufacturer related information 
of the self-descriptive microphone array. 


Manufacturer 


Manufacturer string 


Model 


Model string 


Version 


Version string 


Note that the following table entries are used to provide a general description of the 
configuration of self-descriptive microphone array. 


Array_Type 


Array type: 0=linear, 1=planar, 2=3D, etc. 


NumJMic 


Number of microphones 


Array_Directivity 


Directivity pattern of microphones within the self-descriptive 
microphone array 


Phase_Difference 


Phase difference between AID and D/A conversion 


Note that the following table entries are used to describe the working volume of the 
microphone array 


Elevation_Min 


Minimum working volume elevation in degrees 


Elevation_Max 


Maximum working volume elevation in degrees 


Azimuth_Min 


Minimum working volume azimuth in degrees 


Azimuth_Max 


Maximum working volume azimuth in degrees 


Note that the following table entries are repeated for each microphone in the self- 
descriptive microphone array. 


MicJType 


Microphone type; the three MSB of this entry define type as follows: 
0= omnidirectional, 1= subcardioid, 2= cardiod, 3= supercardioid, 
4= hypercardiod; other custom values possible. Remaining LSB 
determine the particular microphone: 0 - generic type, other numbers 
are assigned to specific microphone/manufacturer combinations. 


Mic_X_Axis 


Microphone X-axis location within the microphone array 


Mic_Y_Axis 


Microphone Y-axis location within the microphone array 


Mic_Z_Axis 


Microphone Z-axis location within the microphone array 


Mic_Elevation 


Microphone orientation in degrees 


Mic_Azimuth 


Microphone orientation in degrees 


Mic_Gain 


Microphone gain * 1000 


MicJ_atency 


Microphone latency, i.e., the time it takes the external computing 
device to receive audio signals, including sampling latencies between 
microphones 


Mic_Freq_Resp 


Frequency response of the microphone 


MIC_Harmonic 


Harmonic distortion of the microphone 


Mic_Coupling 


Terminal coupling loss of the microphone, i.e., how well the 
microphone picks up the speaker 



TABLE 1 (Continued): Example of a Parametric Information 
Lookup Table for the Self-Descriptive Microphone Array. 



-29- 



Note that the following table entries are repeated for each speaker in the self-descriptive 
microphone array. 


Num_Speakers 


Number of speakers 


Speaker_Type 


Designates speaker directivity and frequency response 


Speaker_X_Axis 


Speaker X-axis location within the microphone array 


Speaker_Y_Axis 


Speaker Y-axis location within the microphone array 


Speaker_Z_Axis 


Speaker Z-axis location within the microphone array 


Speaker_Elevation 


Speaker orientation in degrees 


Speaker_Azimuth 


Speaker orientation in degrees 


Speaker_Latency 


Speaker latency, i.e., the time it takes the external computing device to 
send audio signals to speakers in microphone array 


Speaker_Max_Level 


Maximum speaker level, e.g., dB sound pressure level (SPL) at 1 
meter. 


Speaker_Freq_Resp 


Frequency response of the microphone 


Speaker_Harmonic 


Harmonic distortion of the microphone 



TABLE 1 (Continued): Example of a Parametric Information 
Lookup Table for the Self-Descriptive Microphone Array. 



The foregoing description of the self-descriptive microphone array has 
been presented for the purposes of illustration and description. It is not intended 
to be exhaustive or to limit the invention to the precise form disclosed. Many 
modifications and variations are possible in light of the above teaching. Further, 
it should be noted that any or all of the aforementioned alternate embodiments 
may be used in any combination desired to form additional hybrid embodiments 
of the self-descriptive microphone array. It is intended that the scope of the 
invention be limited not by this detailed description, but rather by the claims 
appended hereto. 
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